Assessing the Utility of Automatic Cancer Registry Notifications Data Extraction from Free-Text Pathology Reports

نویسندگان

  • Anthony N. Nguyen
  • Julie Moore
  • John O'Dwyer
  • Shoni Colquist
چکیده

Cancer Registries record cancer data by reading and interpreting pathology cancer specimen reports. For some Registries this can be a manual process, which is labour and time intensive and subject to errors. A system for automatic extraction of cancer data from HL7 electronic free-text pathology reports has been proposed to improve the workflow efficiency of the Cancer Registry. The system is currently processing an incoming trickle feed of HL7 electronic pathology reports from across the state of Queensland in Australia to produce an electronic cancer notification. Natural language processing and symbolic reasoning using SNOMED CT were adopted in the system; Queensland Cancer Registry business rules were also incorporated. A set of 220 unseen pathology reports selected from patients with a range of cancers was used to evaluate the performance of the system. The system achieved overall recall of 0.78, precision of 0.83 and F-measure of 0.80 over seven categories, namely, basis of diagnosis (3 classes), primary site (66 classes), laterality (5 classes), histological type (94 classes), histological grade (7 classes), metastasis site (19 classes) and metastatic status (2 classes). These results are encouraging given the large cross-section of cancers. The system allows for the provision of clinical coding support as well as indicative statistics on the current state of cancer, which is not otherwise available.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Automatic Extraction of ICD-O-3 Primary Sites from Cancer Pathology Reports

Although registry specific requirements exist, cancer registries primarily identify reportable cases using a combination of particular ICD-O-3 topography and morphology codes assigned to cancer case abstracts of which free text pathology reports form a main component. The codes are generally extracted from pathology reports by trained human coders, sometimes with the help of software programs. ...

متن کامل

بررسی میزان کامل‌شماری ثبت سرطان مری در داده‌های ثبت سرطان مبتنی بر جمعیت در استان اردبیل

Background and Objectives: completeness of registration is used as one of the measures of the quality of a cancer registry, which is the degree to which reportable incident cases of cancer in the population of interest is actually recorded in the registry. Methods: After removing the duplicates, a total of 471 new cases of esophagus cancer reported by three sources of pathology reports, medi...

متن کامل

Quality assessment of the registration of vulvar and vaginal premalignant lesions at the Cancer Registry of Norway

BACKGROUND A crucial factor concerning the utility of Cancer Registries is the data quality with respect to comparability, completeness, validity and timeliness. However, the data quality of the registration of premalignant lesions has rarely been addressed. High grade vulvar intraepithelial neoplasia (VIN) and vaginal intraepithelial neoplasia (VaIN) are premalignant lesions which may develop ...

متن کامل

نظام ثبت سرطان بیمارستانی در ایران و مقایسه آن با آمریکا

Introduction: Cancer research is one of the essential activities for its control and treatment. Hospital based cancer registry system is an information system designed to collect, organize and analyze data on cancer. The objective of the present study was to compare hospital based cancer registry system in Iran with that in the USA. Methods: This research was a comparative study. Studied popul...

متن کامل

Creating a rule based system for text mining of Norwegian breast cancer pathology reports

National cancer registries collect cancer related information from multiple sources and make it available for research. Part of this information originates from pathology reports, and in this pre-study the possibility of a system for automatic extraction of information from Norwegian pathology reports is investigated. A set of 40 pathology reports describing breast cancer tissue samples has bee...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • AMIA ... Annual Symposium proceedings. AMIA Symposium

دوره 2015  شماره 

صفحات  -

تاریخ انتشار 2015